The Automation of Controlled Vocabulary Subject Indexing of Medical Journal Articles Some General Considerations concerning Indexing for Bibliographic Databases Information Retrieval Research and Subject Indexing the Subject Indexing Process: a Description of Key Elements and Procedures for Medline Indexing

نویسندگان

  • David Roberts
  • Clive Souter
چکیده

The purpose of this article is to investigate the possibility of the automation of sophisticated subject indexing of medical journal articles. Approaches to subject descriptor assignment in information retrieval research are usually either based upon the manual descriptors included in the database or the attempted generation of alternative search parameters, using statistical, probabilistic or natural language methods or concept networks. A description of the principles of the Medline indexing system is presented, followed by a summary of the outcome of a pilot project, based upon the Amed database. The results suggest that a more extended study, based upon Medline, should encompass various components: 1. Extraction of "concept strings" from titles and abstracts of records, based upon a detailed analysis of linguistic features characteristic of the domain. 2. Mapping rules to associate the concept strings with entries in the Unified Medical Language System (UMLS, a consolidation of various medical vocabularies produced by the National Library of Medicine). 3. Coordination of the descriptors, utilising features of the Medline indexing system and a feedback mechanism relating to the original input. The emphasis should be on system manipulation of data, based upon input, available resources and specifically designed rules, avoiding any implication of system "understanding". Introduction The inclusion of subject indexing terms in the records of bibliographic databases is a common practice. While the sophistication of indexing systems may vary considerably, the objective of this activity is to facilitate effective and comprehensive information retrieval from the databases. The purpose of this article is to consider various aspects relevant to the automation of sophisticated subject indexing. While the utility of such subject indexing may be contentious, the assumption underlying the article is that it does have a useful, even essential, function. A brief overview of some general considerations regarding databases and indexing is presented. Reference is made especially to Medline, one of the best known databases, produced by the National Library of Medicine (NLM), as an exemplar of a database utilising a sophisticated subject indexing system. Next, some of the main lines of information retrieval research are briefly considered with respect to how this research has related to subject indexing. This is followed by a description of the manual subject indexing process, conducted by trained indexers, for the Medline database. The results of a pilot project investigating some aspects of the automation of the indexing process are then presented. These suggest that the possibility of …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مروری بر معتبرترین مجله‌های میکروب‌شناسی پزشکی، بهار 1393

Background and Objective: Publishing articles in specialized journals that are prestigious indexing with international distribution have led scientists to validate the results of the research area. In this case, it may increase and promote the situation of the scientist and the country that research has been done in it. Considering the importance of this issue, the present study aimed to i...

متن کامل

Content Based Radiographic Images Indexing and Retrieval Using Pattern Orientation Histogram

Introduction: Content Based Image Retrieval (CBIR) is a method of image searching and retrieval in a  database. In medical applications, CBIR is a tool used by physicians to compare the previous and current  medical images associated with patients pathological conditions. As the volume of pictorial information  stored in medical image databases is in progress, efficient image indexing and retri...

متن کامل

Automatically Controlled-Vocabulary Indexing for Text Retrieval

The IR society has made efforts in free-term indexing for a long time. By contrast, few efforts are made in controlled-vocabulary indexing. A new model for controlled-vocabulary indexing is proposed in this paper. This proposed model, TF×OSDF×CSIDF, distinguishes subjectspecific words from common words and domain-specific words in documents. 60,400 MEDLINE records are used as training data and ...

متن کامل

بررسی میزان همخوانی عبارت‌های جستجوی کاربران با اصطلاحات پیشنهادی مقالات در پیشینه‌های کتابشناختی پایگاه‌های اطلاعاتی لاتین EBSCO و IEEE

Purpose: This study aims to investigate correspondence of users' queries with alternative terms of Latin databases namely IEEE and EBSCO. Databases display subjective content of their documents through natural or controlled language vocabularies in specified bibliographic fields along with other bibliographic information that are called papers alternative terms. Methodology: We used content an...

متن کامل

مقایسه ساختار اصطلاح نامه‌های پایگاه‌های اطلاعاتی Pubmed و Embase با استاندارد اصطلاحنامه نویسی سازمان ملی استانداردهای اطلاعاتی آمریکا و بررسی شیوه‌های نمایه سازی دو پایگاه

Introduction: According to mortality rates in Iran, cardiovascular diseases, neoplasms, perinatal mortality, and respiratory tract diseases were top rate mortality in 2003(1382). To reduce mortality rate, Iranian medical community need to know more about recent therapeutic regimens. Two main medical databases are Pubmed and Embase. Researching Pubmed and Embase indexing methods and comparing Me...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000